Document Ranking for Effectiveness-Efficiency Tradeoffs

نویسندگان

  • Vo Ngoc Anh
  • Alistair Moffat
چکیده

Method: Building a document ranking system involves two key decisions: choosing a retrieval model, and choosing a suitable index representation. The former determines the effectiveness of the system, the latter the efficiency; and each of them affects the other. The impact-based document ranking mechanism described by Anh and Moffat [2] was chosen for our system because of its balance between effectiveness and efficiency. In terms of effectiveness, it is highly competitive, although still inferior to advanced language modelling implementations. On the other hand, in terms of efficiency the mechanism is excellent, as it ranks documents using a small number of calculations, all on integer numbers. To further facilitate query efficiency, we compress the index using the slide-8 coding scheme [1], which allows an excellent balance between compression ratio and decoding speed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Post-Learning Optimization of Tree Ensembles

Learning to Rank (LtR) is the machine learning method of choice for producing highly effective ranking functions. However, efficiency and effectiveness are two competing forces and trading off effectiveness for meeting efficiency constraints typical of production systems is one of the most urgent issues. This extended abstract shortly summarizes the work in [4] proposing CLEaVER, a new framewor...

متن کامل

Title of dissertation : LEARNING TO EFFICIENTLY RANK

Title of dissertation: LEARNING TO EFFICIENTLY RANK Lidan Wang, Doctor of Philosophy, 2012 Dissertation directed by: Associate Professor Jimmy Lin (primary advisor) Professor Douglas Oard (co-advisor) Department of Computer Science and College of Information Studies Web search engines allow users to find information on almost any topic imaginable. To be successful, a search engine must return r...

متن کامل

Privacy and Efficiency Tradeoffs for Multiword Top K Search with Linear Additive Rank Scoring

This paper proposes a private ranking scheme with linear additive scoring for efficient top K keyword search on modest-sized cloud datasets. This scheme strikes for tradeoffs between privacy and efficiency by proposing single-round client-server collaboration with server-side partial ranking based on blinded feature weights with random masks. Client-side preprocessing includes query decompositi...

متن کامل

Ranking Efficient DMUs in Two-stage Network DEA with Common Weights method

Two stages DEA models are used in many fields of management and industry. One of the concepts that has attracted the attention of researchers in the theory of production is the concept of ranking the units with a two-stage network. A unit ranking can provide useful information to decision makers (DMUs) about optimal decision making activities. This concept defines the superiority of a unit in t...

متن کامل

An Ensemble Click Model for Web Document Ranking

Annually, web search engine providers spend more and more money on documents ranking in search engines result pages (SERP). Click models provide advantageous information for ranking documents in SERPs through modeling interactions among users and search engines. Here, three modules are employed to create a hybrid click model; the first module is a PGM-based click model, the second module in a d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005